A Deeper Look at the Hessian Eigenspectrum of Deep Neural Networks and its Applications to Regularization
نویسندگان
چکیده
Loss landscape analysis is extremely useful for a deeper understanding of the generalization ability deep neural network models. In this work, we propose layerwise loss where surface at every layer studied independently and also on how each correlates to overall surface. We study by studying eigenspectra Hessian layer. particular, our results show that geometry largely similar entire Hessian. report an interesting phenomenon eigenspectrum middle layers are observed most eigenspectrum. maximum eigenvalue trace (both full layerwise) reduce as training progresses. leverage these observations new regularizer based Penalizing indirectly forces Stochastic Gradient Descent converge flatter minima, which shown have better performance. such can be leveraged penalize middlemost alone, yields promising results. Our empirical studies well-known nets across datasets support claims work.
منابع مشابه
Evolution in Groups: A deeper look at synaptic cluster driven evolution of deep neural networks
A promising paradigm for achieving highly efficient deep neural networks is the idea of evolutionary deep intelligence, which mimics biological evolution processes to progressively synthesize more efficient networks. A crucial design factor in evolutionary deep intelligence is the genetic encoding scheme used to simulate heredity and determine the architectures of offspring networks. In this st...
متن کاملLook at the Clinical Skills Center and its applications
Clinical skills centers are one of the potentials in medical universities. Despite the huge spending in our country, these centers are used incompletely and undesirable. These review articles look at these centers, its application in some Iranian universities and foreign universities, and present some suggestions for optimal use of these centers in nationwide universities
متن کاملA Deeper Look at the “Neural Correlate of Consciousness”
A main goal of the neuroscience of consciousness is: find the neural correlate to conscious experiences (NCC). When have we achieved this goal? The answer depends on our operationalization of "NCC." Chalmers (2000) shaped the widely accepted operationalization according to which an NCC is a neural system with a state which is minimally sufficient (but not necessary) for an experience. A deeper ...
متن کاملA Deeper Look into Sarcastic Tweets Using Deep Convolutional Neural Networks
Sarcasm detection is a key task for many natural language processing tasks. In sentiment analysis, for example, sarcasm can flip the polarity of an “apparently positive” sentence and, hence, negatively affect polarity detection performance. To date, most approaches to sarcasm detection have treated the task primarily as a text categorization problem. Sarcasm, however, can be expressed in very s...
متن کاملA Look at Parsing and Its Applications
This paper provides a brief introduction to recent work in statistical parsing and its applications. We highlight successes to date, remaining challenges, and promising future work.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i11.17142